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This is a tutorial for the many-worlds theory by Everett, which includes some of my personal 
views. It has two main parts.The first main part shows the emergence of many worlds in a universe 


consisting of only a Mach-Zehnder interferometer. 


The second main part is an abridgment of 


Everett’s long thesis, where his theory was originally elaborated in detail with clarity and rigor. 
Some minor comments are added in the abridgment in light of recent developments. Even if you do 
not agree to Everett’s view, you will still learn a great deal from his generalization of the uncertainty 
relation, his unique way of defining entanglement (or canonical correlation), his formulation of 
quantum measurement using Hamiltonian, and his relative state. 


Part I 
Prologue 


Although Everett’s many-worlds theory is now well 
known, it is still a minority view among physicists. 
There are many reasons, which I have no intension to 
discuss extensively here. One of the reasons may be 
that many physicists have not read his work seriously. 
Everett’s theory was presented in his PhD thesis, which 
has two versions. The long version has over 130 pages 
and was finished in 1956. It was published only 17 years 
later for the first time with the title The Theory of The 
Universal Wave Function in the book edited by DeWitt 
and Graham [I] and was re-published with commentary 
in 2012 [2]. Due to Bohr’s objection, Everett had to 
shorten it. The short version became his official PhD 
thesis at Princeton University [3] and was published 
with the title “Relative State” Formulation of Quantum 
Mechanics in Review of Modern Physics [4] accompanied 
by an article by his advisor Wheeler {5}. 


On the one hand, Everett’s short thesis lacks many 
important results in his long thesis, e.g., entanglement 
(or canonical correlation) and formulation of quantum 
measurement. On the other hand, the long version may 
be too long for many people’s patience, which is further 
exasperated by Everett’s mathematical notations that 
are not familiar to modern readers. It is my hope that 
this abridgment makes a good compromise between the 
long and short thesis. In this abridged version, I will 
keep its structure and stick to Everett’s original state- 
ments as much as possible at key points while omitting 
detailed discussion and derivations. Entanglement is 
all over the long thesis. However, Everett never used 
the word entanglement; instead, he called it canonical 
correlation or simply correlation. I will use entanglement 
in this abridgment. In addition, I'll use Dirac brackets 
wherever possible. 


Before the abridgment, I use the Mach-Zehnder inter- 
ferometer (MZI) to illustrate the many-worlds theory. 
It appears to me that the MZI is a simple example 
to illustrate all the essential points in Everett’s long 
thesis. In particular, the MZI is ideal to demonstrate 
interference between different worlds and the essence of 
approximate measurement, which was discussed in detail 
by Everett in his long thesis and has not been discussed 
much since. Near the end of this part, I also discuss 
the issue of preferred basis and I think that it is related 
to the perceptive abilities of observers. Hopefully, this 
example of MZI will aid your reading of the abridgment. 
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FIG. 1: Mach-Zehnder interferometer. 14, and M3 are half- 
silvered mirrors; Mz and Mz, are reflective mirrors. S is some 
sample material that causes a phase shift to the photon’s wave 
function. Dy and Dy are two photon detectors. Note that 
M, and M3 are not symmetric and their two sides are made 
of materials of different refractive indices. 


The two words, universe and world, are often used 
differently by different people when they discuss Ev- 
erett’s theory. It was first dubbed “many-worlds” 
theory by DeWitt [I]. In this way, we say that there 
is one universe that consists of many different worlds. 
However, Everett’s theory has recently often been called 
the theory of multiverse. In this way, we say that there 
is one world that consists of many different universes [6]. 


It rubs salt to the wound that multiverse has different 
meanings for different people in literature [7 [8]. So, to 
avoid the confusion, we stick to DeWitt’s term and say 
that there is one universe that consists of many different 
worlds. 


It is my sincere hope that you will eventually find time 
to read Everett’s long thesis in its entirety, which is richer 
in content than the short version and juicier than this 
abridgment. Finally, even if you do not agree to his view, 
you will certainly get entertained and inspired by how 
Everett generalized uncertainty relation, defined entan- 
glement (or canonical correlation), formulated quantum 
measurement, and introduced relative state. 


Part II 
The universe of MZI 


The Mach-Zehnder interferometer (MZI) was proposed 
by Zehnder in 1891 [9] and was refined by Mach in 
1892 [i0]. Due to its simplicity and flexibility, the MZI 
has not only enjoyed wide applications [IJ] but also of- 
ten been used for illustration of fundamental subtleties 
in quantum mechanics [6} [12]. We follow the crowd and 
use it to illustrate Everett’s many-worlds theory. We first 
briefly review the Mach-Zehnder interferometer (MZI) in 
a conventional way. 

As shown in Fig{T] MZI consists of four mirrors. The 
mirrors M, and M3 are half-silvered and serve as beam- 
splitter. The mirrors Mz and My are reflective. Initially 
the photon’s wave function has only the horizontal com- 
ponent, i.e., |Wo) = |dx). After the photon encounters 
M,, its wave function splits and has two components 


1 

v2 
Note that there is an ambiguity for the phase difference 
between the transmitted component |¢#) and the reflec- 
tion component |dy). In general, the phase difference 
depends on the incident side of the beam splitter. If 61 
is the phase difference when the photon is incident on 
the left side of the beam splitter and 42 is the phase 
difference when the photon is incident on the right side, 
then 6; + 69 = 7 [13]. The exact values of 6; and 62 
depend on how the beam splitter is manufactured. In 
the above we have used 6, = 7 and dg = 0. This occurs 
when the two sides of the beam splitter are made of 
materials of different refractive indices [14]. With this 
choice of phase differences, the beam splitter functions 
exactly as a Hadamard gate. 


21) (lex) — |év))- (0.1) 


The sample S' causes a phase shift @ to the wave func- 
tion that passes through; the reflections by the mirrors 
Mp2 and My, interchange |¢;;) and |¢y) and cause a 7 
phase shift. As a result, right before encountering the 


mirror M3, the photon’s wave function becomes 
1 
V2 


At the mirror M3, |¢z) further splits into two compo- 
nents becomes (|¢#) + |¢v))/V2 and |@v) splits into 
(—|¢x) +|év))/V2. Consequently, we have 


|b2) 


1) = s(e" lox) — |dv)). (0.2) 


s le (Ix) + |¢v)) + |¢x) — lov) | 


i 6 wae @ 
= e'9/2 (cos 3 lox) +isin 5 lov) ). (0.3) 
This means that the probability that the photon be 
detected by the detector Dy is cos*(@/2) and the 
probability detected by the detector Dy is sin?(0/2). 


The mysterious part of the MZI is the following. On 
the one hand, the photon wave function has two parts, 
lox) and |dy), right before the detection. On the other 
hand, in a single run of the experiment, there is only 
one detection either at Dy or Dy. Suppose that Dy 
detects a photon in one experiment; this detection is 
clearly triggered by the |édq) term in Eq. (0.3). So, 
why does not the other term |éy) trigger a detection 
at Dy? What has happened to |dy)? According to 
the conventional view , both |¢;7) and |¢y) can trigger 
detection, but it is purely random which one triggers. 
Furthermore, when one of them triggers detection, the 
other part magically disappears. This is called the 
collapse of wave function. Everett showed in details the 
collapse of wave function would lead to two difficulties 
in his long thesis [I]. The first difficulty is that it would 
lead to logical inconsistency when there are two or more 
observers; the second difficulty is that it is inadequate 
to deal with approximate measurement. 


In both his long thesis and short thesis [I] [4], Ev- 
erett had used branches or just elements of superposition 
instead of worlds referring to the different superposition 
components in a wave function. Here in this work, we 
will often use worlds as his theory is now widely known 
as the many-worlds theory [I]. 


We now analyze MZI with the many-worlds theory. 
We assume that the universe consists only of MZI and 
nothing else. There is no gravity. The two detectors can 
absorb the photon with 100% efficiency. The mirrors 
My, and Mz are at rest initially and arranged as in Fig[I] 
with no support or attached wires while both the mirrors 
Mz and My are fixed in space. With mirrors M; and M3 
movable, we can discuss the conditions for interference 
to occur. If one consider a more complicated situation 
where neither of the mirrors Mz and My, are fixed, the 
analysis would become much more complicated without 
gaining essentially new physics. 


There are two different kinds of interactions in this 
universe of MZI: photon with half-silvered mirror and 


photon with the reflective mirror. We use U, denote the 
former and Up the latter. The interaction at the mirror 
M, can be mathematically expressed as 


Ue lox) ® Ivo") 
= 5 (leu) @ Wal) — lov) @ 18h). (Ox) 


where |y/") and 0") are the states of the mirror before 
and after the interaction, respectively. After the interac- 
tion, if the photon continues to move horizontally, noth- 
ing changes; if the photon moves vertically, the mirror 
acquires a momentum ) and its state becomes |)h!). 
Overall, it results an entangled state between the photon 
and the mirror. Similarly, at the mirror M3, we have 


Uc |ox) ® |U9*) 


= 5 (leu) @ Wd) + lov) @18)) . (08) 
and 
Ue |dv) @ |vo"") 
= 5 (lev) @ 103) — bn) @188)) - (06) 
Note that p; = —p3. The reflective interaction at the 
mirror Mp has the following mathematical form 
Uo lou) ® |p) =—|dv) @ vo). (0.7) 
And similarly at the mirror M4, we have 
Uolov) ® lo) =—lbx) @ lo"). (0.8) 


No entanglement is generated in this interaction and the 
mirrors do not gain momentum as they are fixed in space. 


Initially the universe of MZI is described by the follow- 
ing wave function 


|Yo) = |b) ® |[¥o"2) ® bg?) 
@ [Wo"*) ® bo") @ |WP*) @ dN”) 


Whenever there is no confusion arising, we omit © and 
simplify the above the expression as 


(0.9) 
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After the photon interacts with the mirror M1, we have 


1 
[Wa) = Uelibo) = Fe (lbs ¥a") — love vpn')) 
@ v0 ves eo wer weY). (0-11) 


According to the many-worlds theory, the two compo- 
nents in |W ), which are orthogonal to each other, repre- 
sent two different worlds: in one world the photon travels 
horizontally and in the other world the photon travels 


vertically. After the sample S, we still have two worlds 
but one world has acquired a phase shift 


1 ; 
MM) = (elon. 48") — lov, e")) 
@ We wee vd aPH,aPY) (0.12) 


The photon is then reflected by the two mirrors Mz and 
My, and the state of the universe becomes 


1 ; 
[W2) = oli) =e (du vp) — e" lov. va") 
6 a ee lhe eo (0.13) 


The universe still has only two worlds. Now the photon 
interact with the mirror M3, resulting the following state 
of the universe 


|[W3) = U.|W2) 
= 5 [lee vet uh) + dv, uh vee) 
—e* (bv, Wo. d3”) = [ou vo bi)) 
@ 97, Wo, Yodo”) - (0.14) 


It appears that there are now four worlds in the universe. 
But in general the four terms above are not orthogonal 
to each other. There are in fact seven worlds. To see it, 
let us expand |~)"1) as 


[epnt) = a gt) + Br (bt) , (0.15) 
where (7""|y¥9"") = 0. Similarly, we have 

[epa) = avs [Whg"*) + Bs lb) , (0.16) 
and 

[abpa?) = a [bo?) + 83 WY*) , (0.17) 


where (W{3|yq*) = 1 lo?) = = 0. In the above we 
have used that p; = —p3 implies |jis) = |fls*). We 
will discuss these coefficients a1 ,3 and G13 lates With 
these expansions, we have 


Is) = 5 (ea tea) lore, vb, v8) 

+(aa03 — e' ley, ae ia 

+B lou, V1, 0G 0°) + Bie" loa, at) 
+a36r lov, 01", v9") + 0183 dv, 097, 1°) 
+8183 |ov, vy", vy) 


@|d0?, 00", do" 40”) - 








(0.18) 


These seven terms are orthogonal to each other. In the 
end, the photon is detected by the detectors and we have 
a universe that consists of seven different worlds that 


exists simultaneously. And the wave function of the uni- 
verse is 
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In this final state, everything in the universe except the 
mirrors Mz and My, are entangled together. The photon 
detection should be presented as 


lou) ® Wo") > |WP*) (0.20) 


to reflect the fact that the photon is absorbed by the 
detector. However, in the above, to explicitly represent 
the photon state before the detection, we have kept dy 
and ¢y. This should not cause confusion. 











(0.19) 


We consider two special cases. In the first case, which 
we call pure interference (PI) case, ay = a3 = 1 and 
8, = 03 = 0. In the PI case, we have 
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Wp) = 5 [A +e) lar ve, es, wP* we”) 
+(1— 6) [ov va de" YP) 
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This is exactly the state in Eq. (0.3). The only difference 
is that the states of mirrors and detectors are not 
expressed explicitly in Eq. (0.3). This case happens when 
the two mirrors M, and M3 are very massive or mounted 
in space and unmovable. Fig|2{a) shows how the worlds 
split and evolve in this case. Initially, there is only one 
world and it splits into two worlds with equal weight 
at the mirror M,. These two worlds evolve in parallel 
without changing their weights before interfering at the 
mirror M3. As a result of the interference, we still have 
two worlds but with different weights. In the special 
case 9 = 0, there is only one world after the interference. 


Consider the second special case, ay = a3 = O and 


Bb, = 83 = 1. We call it pure split (PS) case. In the PS 
case, we have 
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FIG. 2: Worlds in the universe of Mach-Zehnder interferom- 
eter in two special cases. (a) The pure interference (PI) case; 
(b) the pure split (PS) case. |Wzz) and |Wy) represent worlds 
where the photon moves horizontally and vertically, respec- 
tively. The superscript d in |4,) and |U¢) represents the 
states of mirrors has changed relative to the initial state of 
the MZI universe. 
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The evolution of the worlds in this case is illustrated 
in Fig[2{b). The evolution is similar to the first case 
before the mirror M3. The crucial difference is that 
there is no interference at M3 in this case. Con- 
sequently, the worlds keep splitting and we obtain 
four different worlds with equal weights. And the phase 
shift # has no effect on the weights of the different worlds. 


(0.22) 


We now examine the expansions in Eqs. (0.15]0.16]0.17) 
in detail. The mirrors, made of atoms, have enormous 


amount of degrees of freedom. However, in this MZI uni- 
verse, only their centers of mass are relevant. Moreover, 
their centers of mass move only along p; = —p3. With 
these considerations, we are allowed to describe the states 
of the mirror M, before and after the interaction as the 
following Gaussian wave packets 


iif 1 ae = 
(old) =(—5) ep(-55), — (0.23) 
and 
1 \1/4 5 
(awa) = (<5) exp (— 55 + ike), (0.24) 


where a is the width of the wave packet and Ak = |pj|. 
We obtain 


ay = (Wo bh) = exp ( ~ 7a°k?) (0.25) 


Similarly, we can compute ag and find that 
a3 = as = a,. In real experiments, the wave 
length of the photon is much larger than the width a; 
so we have aj = a3 = a3 ~ 1. This is exactly the PI 
case in Fig Pa). One may want to use a photon with 
much shorter wave length so that a; = a3 = a3 < 1 
and 8, = 63 = 63 ~ 1, ie., the PS case. However, the 
interaction of mirrors with shorter-wave-length photon 
is very different and the MZI can consequently cease to 
work. 


There is one possible way to realize the PS case as illus- 
trated in Fig 2{b). This is to add a very sensitive detector 
DP that is capable of measuring the tiny momentum that 
a mirror gains after interacting with the photon. If the 
momentum is zero, the detector is described by |DPo); 
if the momentum is p; or 73, the detector has the state 
|DP,). These two states should be orthogonal to each 
other (DPp|DP,) = 0 to reflect the effectiveness of the 
detection. With the addition of the new detector, the 
interaction U; in Eq. (0.4) can be re-written as 


/ |bi) ® |Wo") ® |DPo) 
7 (lon) @ Wh", DP) — |v) @ WM, DP,)) . 
(0.26) 


~ Va 


Let |v") = |v", DPo) and |i) = |W, DP). We 


clearly have (Wat |r) = 0. As a result, when we ex- 


pand as in Eq. (0.15) for |p), we should have a; = 0 


and 8; = 1. Similarly, we should have a3 = 0 and (63 = 1. 
In this way, we have effectively realized the PS case in 
Fig{b), where the worlds have split twice with no inter- 
ference. Note that the discussion with the detector DP is 
a matter of principles, not for realistic realization. In real 
experiments, other methods may be used to distinguish 
the two states |W") and |y") or tell which direction 
the photon is going after encountering the mirror M,. 


We now discuss the general case. As the above analysis 
shows that a; = a3 = a and £; = $3 = 63 = B, we have 
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This wave function is a superposition of seven mutually 
orthogonal elements, each of which describe a world. In 
the order in the above equation, we call them worlds I 
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FIG. 3: Worlds in the universe of Mach-Zehnder interferom- 
eter in the general case. For clarity, the weight of each world, 
which is represented by the length of the vertical line, is not 
plotted to scale. The superscript d in |W%) and |¥¢) repre- 
sents the states of mirrors has changed relative to the initial 
state of the MZI universe. 


II, I, IV, V, VI, and VII. The worlds I and II are the 
results of interference with the world I existing only in 
the PI case while the world II existing in both the PI 
and PS cases. The worlds III, IV, and VII exist in the 
PS case. The worlds V and VI are new and do not exist 
in either of the two special cases. To understand these 
two new worlds, we expand the interaction in Eq. (0.4) 


with Eq. (0.15) 
Ue on) © 13) = <e[ lox) © 3") 


—aldv) ® ds") — Blav)@ Wt) ]. (0.28) 
The term with a represents that the photon changes its 
direction by the mirror MM, but the mirror state does 
not change. We call it reflection with no detection. 
The term with § represents that the photon changes 
its direction by the mirror M, while the mirror state 
becomes orthogonal to its original state. We call it 
reflection with detection. So, in the world V, the photon 
is reflected by the mirror M, with detection and then 
reflected by the mirror M3 with no detection; in the 
world VI, the photon is reflected by the mirror MM, with 
no detection and then reflected by the mirror M3 with 
detection. 


The analysis with the general case illustrates a crucial 
point that the photon interferes only when its different 
components, ¢y and gy, do not cause difference in the 
rest of the universe (e.g. the mirrors and detectors). 
Whenever the different components of an object’s wave 
function cause difference in other objects, interference 
disappears and decoherence occurs. 


In his long thesis, Everett offered an insight into quan- 


tum measurement. In his view, quantum measurement 
is a generation of entanglement between two subsystems 
by an interacting Hamiltonian. We now illustrate it 
with the es al generation interaction described 
in Eq. . The photon is the “apparatus” whose read- 
ing is given by the operator Aas, The eigenstates of Ap 
are 





lov)), (0.29) 
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|e) = yallew = 





such that A, = +|¢+). The system is the mirror M4, 
whose property to be measured is given by the operator 
Bm. The eigenstates of Byy are 








g(t") = |My) 


with eigenvalues being 1 and 2, respectively. Before the 
interaction between the photon and the mirror, we have 


lW12) = (0.30) 





Jon) © WS) = 5 (164) + 1d-)) © (a) + hea), (0.31) 


where there is no entanglement between the photon and 
the mirror at all. After the interaction U., we have 
Eq. (0.28). We first consider the special PS case, a = 0 
and 6 = 1. In this case, we can re-write the right hand 


side of Eq. (0.28) as 
<3 (lon) @ Ws") — |v) @ Wt") ) 


= a (e+) @ ln) +16) Ohya). (032) 
We have a maximum entanglement (a perfect correlation) 
here: when the apparatus (photon) reads ‘+’, we know 
the mirror is in the state |v); when the apparatus reads 
‘—’ the mirror is in the state |~2). In the general case, 
after the interaction U., we have 


Ue lad ® iv 


= |4) ® (a |v) + 





= |i) ) 





+ 


=" |6-)@ (Sti) +&Ius)), 
(0.33) 


where @ = (JV1+a+¥J1—a)/2. We no longer have 
a perfect measurement. When the apparatus (pho- 
ton) reads either ‘+’ or ‘—’, the mirror is not in the 
eigenstates of By, the target of our measurement. 
When a is only slightly smaller than one, the resulted 
states are very close to the eigenstates of By and can 
be regarded as an approximation. Everett call this 
kind of measurement approximate measurement. It is 
clear in the special universe of MZI the approximate 
measurement is more common than the precise measure- 
ment. It is the same in our universe, the general universe. 


Several caveats are warranted here. (1) The approxi- 
mation is not the result of noises or other random factors 


in real experimental setup. (2) The two operators A, 


and Buy are introduced for theoretical illustration; 
it seems unlikely that they can be realized in real 
experiments. (3) To the best of my knowledge, nobody 
appears to have studied approximate measurement 
thoroughly since Everett , many fundamental questions 
need to be answered, for example, the precise definition 
of approximate measurement. 


The above discussion has led us to another intriguing 
issue in quantum mechanics. We use Eq. (0.32) as an 
illustration. On the left hand side, we have a familiar 
universe that has split into two worlds: in one world, 
the photon moves horizontally and the mirror stays the 
same; in the other world, the photon moves vertically 
and the mirror changes into a state orthogonal to its 
original state. On the right hand side, the same universe 
is split to two very different worlds: in one world, the 
photon is in the state ¢,, an eigenstate of Ap, and the 
mirror is in the state ~1, an eigenstate of B,; in the 
other world, the photon is in the state 6_ and the mirror 
is in the state wo. In fact, there are infinite ways to 
re-write this entangled state. So, which represents the 
reality? For us, the world where the photon moves either 
horizontally or vertically is the reality since we have the 
ability to measure the photon’s position and the ability 
to measure whether the mirror has momentum or not. 
If a different kind of creature or instrument can make 
measurements according to A, and Be, then the right 
hand side of Eq. (0.32) is the reality. 


What kind of world that we perceive depends on our 
abilities of perception. These different abilities mathe- 
matically correspond to different bases. Suppose |®y) is 
the wave function for the whole universe. For one group 
of observers O, with a given set of measurement abilities, 
it can be decomposed in a set of basis as 





ur) = Oi) [Sap pee Oa) ee (0.34) 


These |®,)’s are the worlds perceived by Oy. For another 
group of observers Og with a different set of measure- 
ment abilities, the universe wave function can be decom- 
posed in a different set of basis as 


|®u) = |@1) + [G2) +--+ [B;) ++ 





(0.35) 


The worlds ©,’s are very different from the worlds |G,)’s 
It is possible that even the space-time that we are ex- 
periencing may look very different for another group of 
observers. 


Part III 
Abridgment of 
Everett’s long thesis 


This abridgment is done by mostly paraphrasing Ev- 
erett’s long thesis; Everett’s words in complete sentences 
are rarely used. Chapters in the thesis become sections 
in this abridgment. Everett used correlation or canoni- 
cal correlation to mean entanglement in his thesis; I use 
entanglement in the abridgment wherever correlation is 
meant entanglement in Everett’s thesis. In some cases 
the mathematical notation of Everett has been updated 
to more modern style, such as in the use of Dirac bracket 
notation. The author’s words are indicated with italic 
font. 


I. INTRODUCTION 


An isolated quantum system is completely described by 
a wave function |). According to standard textbooks on 
quantum mechanicsthe wave function |y) can change in 
two fundamentally different ways [15] 


Process 1 Observation with respect to operator O that 
has eigenfunctions |¢1) ,|¢2) ,|¢3) ,--- will trans- 
forms discontinuously the wave function |y) to 
one of the eigenfunctions, |¢;), with probability 


| (Pile) |? 


Process 2 Continuous and deterministic change of the 
state |) with time according to the Schrédinger 
equation 


F) P 
S|) = AI) (1.1) 


where A is the operator. 


Process 1 is commonly known as the collapse of wave 
function. 


The above scheme can lead to a paradox when 
there are more than one observer. Consider a room 
isolated in space where one observer A is to perform a 
measurement on a system S and will record the result 
in a notebook. The observer A is aware that the system 
S is in a quantum state |) that is not in an eigenstate 
of the measurement. Another observer B is outside of 
the room. Beside knowing the quantum state |W) and 
A is to perform a specified measurement, B has no 
interaction at all with the room and everything inside 
the room. The observer A performs the measurement 
and records the result in the notebook. One week later, 
B enters the room and performs his measurement, that 
is, taking a look at the notebook. A and B soon find 
themselves disputing each other: A insists that Process 


1 (the collapse of the wave function |W) occurred when 
he performed the measurement. B is confident that the 
whole room should evolve according to Process 2 for 
one week. Process 1 occurred only when he enters the 
room and performs his observation by looking at the 
notebook. There are five different ways to resolve the 
paradox or the dispute between A and B. 


Alternative 1 To postulate that there is only one ob- 
server in the universe. 


Alternative 2 To limit the applicability of quantum 
mechanics: quantum theory fails when it is applied 
to observers, measuring devices, or more generally 
any system of macroscopic size. 


Alternative 3 To deny the possibility of the outside ob- 
server B could ever be in possession of the state 
function of A and S$, where A is the observer in- 
side the lab and S' is the quantum system that A 
measures. 


Alternative 4 To abandon the position that a wave 
function is a complete description of a system. 


Alternative 5 To assume that the universal validity of 
the quantum description by the complete abandon- 
ment of Process 1, i.e., the collapse of wave func- 
tion. 


Alternatives 1 and 2 are clearly hard to defend. Alter- 
native 4 can be viewed as hidden variable theory. Local 
hidden variable theory has been refuted by Bell’s inequal- 
ity [16). Alternative 3 is a bit ambiguous, at least in my 
opinion. No matter what, the first four alternatives need 
additional assumptions. In contrast, alternative 5 has 
many advantages: 


e It relies on two basic ingredients of quantum me- 
chanics: (1) The wave function |y) in a Hilbert 
space offers a complete description of a quantum 
system; (2) |W) evolves unitarily according to the 
Schrodinger equation. 


e The quantum theory applies to the entire universe. 


e Measurement is no longer a special process and can 
be described as any other physical processes. 


The key for developing alternative 5 is to study compos- 
ite quantum systems and exploit the entanglement ( or 
correlation in Everett’s own words) between subsystems. 


II. PROBABILITY, INFORMATION, AND 
CORRELATION 


This section (or chapter as used in Everett’s thesis) 
offers a very general mathematical treatment of infor- 
mation and correlation, which is used in later sections 
to define correlation (or entanglement) between different 
quantum subsystems. 


A. Finite joint distribution 


For a collection of finite sets, ¥', ¥?,---,4", we can 
define a joint probability distribution, P(x}, ee aR), 
where x; € ¥',x7 € X?,---,a% € A”. This is the prob- 
ability that events aj},23,---,2% occur simultaneously. 
We can also define the marginal distribution 


NG+1 VI+2 |... en 
(2.1) 


where the summation is over all possible elements in 
XI HIF? ...., 4H". This is the probability that events 
©; ,2%,**+,@, occur with no restrictions on other sets 
XItl XI+2..., 2X". The conditional distribution is de- 


fined as 








1 22 
Pian, on (is TR, rd) 
A. Ad J j+l 
_ PO Cig ys seen) (2.2) 
= aH 
Pla, a) 

which is the probability that events x}, xZ,--+, x} occur 
while other variables are fixed at 2t!,---,a7?. 


For any function f(aj,23,---,2%) defined on sets 


X1,X?,..-,X"”, its expectation is defined as 


X12 eM 
(2.3) 
where the summation is over all possible values in sets 
1, X2,..-,X". Two variables ¥! and ¥? are indepen- 


dent if the joint distribution P(aj,27) = P(#;)P(27). 


B. Information for finite distributions 


For a single random variable ¥ with distribution 
P(x;), its information I is defined as 


Ix =} / P(wi) In P(x) = (In P(a)) . (2.4) 


This is just the negative of Shannon’s entropy. If ¥ has 
m. different values, the maximum of Jy is zero and the 
minimum of Jy is —lnm. The former corresponds to the 
case where one value, say x;, has P(x;) = 1 and the other 
values have P(x # x;) = 0. The latter is the case where 
every value has the same probability P(x;) = 1/m. This 
definition can be easily generalized for many variables 
pr oe ae 
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(In P(aj,23,--+,2%)) : (2.5) 


Similarly, one can also define information for the condi- 
tional distribution. It is clear that if all the random vari- 
ables ¥!, X?,---,4” are independent from each other, 
we have 


Iy142,.,.4¢n = Ix: + Iy24+---+tTyn. (2.6) 


C. Correlation for finite distributions 


For two random variables VY and JY, the correlation 
between them is defined as 


OX, Y) = Ley —Iy— Ip. 


It is clear that we have C(¥,Y) = 0 if two ran- 
dom variables ¥ and Y are independent. This defini- 
tion can be generalized to group correlations. Suppose 
we have groups of random variables, ¥!,X?,---,4"; 


(2.7) 


V1 Y?, +++, V™; «+ Zl, Z,---,Z*, the correlation be- 
tween these groups is 
CO hy AE yo ee re, -, 2°) 
— Dyk MY ee VM pe Zh ee Zl 
—Iyi...¢n — Tyr..jym — +++ — Iga ge. (2.8) 


A special case of this group correlation is 


C(41, XB”) = Ty y2.... 30 
Iy1 —Ipx2—+++—TIyn. 





(2.9) 


D. Generalization and further properties of 
correlation 


We shall now generalize the definition of correlation to 
joint probability distributions over arbitrary sets of any 
cardinality. To do this, we consider the refinement of a 
finite distribution. Consider a random variable ¥ con- 
sisting of finite number of events {x1,22,---,2n}. It is 
possible that the event x; is actually the disjunction of 
several exclusive events {%;,1, %,2,---,%i,,.}. The distri- 
bution P’(%;,,) is called a refinement of the distribution 


Paes: P'Gip)s (2.10) 


where the summation is over all possible values of v for a 
given x;. This can easily be generalized to multiple vari- 
ables. For a distribution P(x;,y,;) of two random vari- 
ables Y and y and its refinement P(%;,,,9;,,), there ex- 
ist two correlations C(4’,) and C’(#, Y), respectively. 
There is an interesting and important relation between 
these two corrections 


CO VIS CR): (2.11) 


With this relation, we can generalize the correlation to 
any probability measure over continuous variables. 


For simplicity, we consider two continuous random 
variables ¥ and Y and a probability measure M(4, y) 
over their cartesian product. We can divide % into finite 
subsets 4; and Y into finite subsets V;. This naturally 
leads to a probability distribution P,(%;,Y;), which can 
be obtained by integration of M(#¥,Y) over these sub- 
sets. With P\(4;,;), we can compute the correlation 
C,(X,¥Y) between ¥ and Y. By further dividing the sub- 
sets ¥; and Y,;, we can have another correlation C2(¥, )). 
By repeating the process, we have a sequence of correla- 
tions 


CAVA VSS ORY: a9) 


As a result, the correlation between two continuous ran- 
dom variables ¥ and ) is defined as 


C(¥X,Y) = lim Cr(X,Y), (2.13) 
where n — oo means that the division becomes finer and 
finer, approaching the continuous limit. 


Suppose that f is a one-one map, U = f(4), and g is 
a one-one map, V = g(V). We have 


C(x, ¥) =CWU,Y). (2.14) 


This shows that the correlation is invariant under one- 
to-one transformation. 


E. Information for general distribution 


For a random variable ¥ with a finite set of values 
{zi}, we assign a positive number a; to each value 2;. 
These a; are called information measure. If the probabil- 
ity distribution is P(x,), its information relative to this 
information measure is defined as 


Ix = ss P(a;) In Pees) = (In Ey) 


ay 





y. (2.15) 


For multiple variables, say, V,,2Z, with information 
measures {a;}, {bj}, {cx}, respectively, and a joint proba- 
bility distribution P(2;, y;, z,), their information relative 
to these measures are 

P Li, Uj, Hk 
Ixyz = S > P(xi,yj,2%) In PSS) : 


ajdjCp 


(2.16) 
ijk 


The previous definition of information is a special case 
where all values of {a;}, {b;}, {cx} in the informa- 
tion measure are unity. Interestingly, the correlation 
C(X,Y, Z) is independent of information measure. 


The advantage of introducing information measure is 
that we can now generalize information for continuous 


variables. For example, for a continuous variable, 7, 
with a probability distribution P(x), we can divide it 
into finite sets VY; and use ps; for the Lebesgue measure 
of the set ¥;. We then have 


i= oS P(a;) In EA) ‘ 


Mi 





(2.17) 


where P(x;) is the probability over the set 1;. We can 
further divide and refine the sets ¥; and define informa- 
tions correspondingly. These informations form a series 
which has an upper limit. We define this upper limit as 
the information for Y with probability distribution P(x) 


Ix =limIk. (2.18) 
iv 


Ill. QUANTUM MECHANICS 


Quantum mechanics has two basic ingredients: (1) the 
states of a quantum system are vectors in a Hilbert space; 
(2) the time evolution of an isolated quantum system is 
given by a linear wave equation. One crucial question 
is whether we need more to relate quantum mechanics to 
our experimental and daily experience. Many physicists 
represented by von Neumann think that we need at least 
one more ingredient, Process 1, which was mentioned at 
the beginning. Everett thinks that no more ingredient (or 
assumption) is needed. 


A. Composite quantum systems 


Consider a pair of quantum systems $; and S92. If their 
Hilbert spaces are H, and Hoe, respectively, the Hilbert 
space of the composite system S = S; + Sp is Hi @ Ho. 
If |€;) is a complete orthonormal set for H; and |7;) for 
Hy, a general state of S = S; + Sj can be expressed as 


|v) = eis fi, 3) , (3.1) 


where |&;,7;) is a shorthand for |£;) ® |n;). The concepts 
introduced in the last section can be applied here. Let A 
be a Hermitian operator on S$; with eigenfunctions |¢;) 
and eigenvalues 1; and B be a Hermitian operator on $2 
with eigenfunctions |y;) and eigenvalues v;. Then 


Piz = | (hi, G5 |W) |? 


is a joint square-amplitude distribution of the quan- 
tum state |W) over A and B. Note that Everett did 
not use probability distribution here. The physical mean- 
ing of square-amplitude is discussed later. It has two 
marginal distributions 


P(¢;) = oe ($8, 95|¥) 7, 


(3.2) 


(3.3) 


and 


P(pi) = Ds | (di, esl ¥) |? (3.4) 


Correspondingly, there are two conditional distributions 
P;(di) = Pig/P(9;) , (3.5) 


Pi(p3) = Pis/P(¢i) - (3.6) 
These distributions can be used to compute the marginal 
and conditional expectations of A or B. 


A key concept introduced by Everett is relative state. 
For a given state |7) in S2, there is a corresponding rel- 
ative state in Sj, 


\bn) = Ny (nl W) = Nin DS Cis |&) (nlny) 5 (3.7) 


where N,, is a normalization constant. For a given 
state |7), its relative state |¢,) is clearly unique and 
independent of |€;) and |n;). The relative state |¢,) can 
be used to compute expectation of any operator A on 
Si conditioned by the state |7) in Sb. 


If |7) is one of the basis states |7;), we have 


lons) = Nn; os cig |&i) - (3.8) 


It is clear that 


MW) =D a Hon) ln) (3.9) 


Two different relative states |@,,) and |d,) are not nec- 
essarily orthogonal 


(Gnjlome) = NaN Seen #0. (3.10) 
a 
In a general state |V) of S, the subsystem S$) can not 
be described by a single state but by a mixture of states. 
We usually use the density matrix to describe this kind 
of mixture. For the whole system, it is always in a pure 
state |W) and its density matrix is 


p=|¥) (W]. (3.11) 


By tracing out the subsystem S2, we have the density 
matrix for the subsystem 5} 


1 
pr = Sd casey |G) Eel = 0 ape Ions) (nal - 
tk gj j nj 


(3.12) 
Similarly, we can define pz. The most important con- 
clusion of this section is that it is meaningless to ask the 
absolute state of a subsystem — one can only ask the state 
relative to a given state of the remainder of the system. 
What Everett is discussing here is of course entangle- 
ment: in a composite system where the subsystems are 
entangled, the subsystems are described by density matri- 
ces not pure quantum states. 
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B. Information and correlation in quantum 
mechanics 


Consider an operator A, which has eigenstates |€;) with 
eigenvalues y;. The information of this operator in a 
given state |7)) is defined as 


Tab) = 971 (él) Pm | (bb) (3.13) 


The operator A has been assumed to be non-degenerate. 
If A is degenerate, that is, for eigenvalue j1;, there are 
multiple eigenstates |€;,,) (A = 1,2,---,m4), its infor- 
mation is defined as 


Tad) = Du = | (G.al¥) ') in Zeal ise 
» vi 


i 
(3.14) 
For convenience, we introduce projection operator 


P, =~ [éa) (Gal (3.15) 
aN 


with which we have 


S71 Exaly) ? = Tr(Ps), (3.16) 
oN 


where p = |w) (|. The information of operator A now 
has a concise form 


Tr(pP:) 


M4 





I4() = Ds Tr(pP;) In (3.17) 


We consider again the composite system S = S; + So. 
For the operator A that acts only on S$, and B only on 
Sp, we assume for simplicity that both of them have no 
degeneracy. Their joint information is 


Tap(@) = >> Tr(pPfP?) in Tr(pP#P?), (3.18) 
aj 
where P# = |&) (&|, P? = |nj) (njl, and p = |W) (Uj. 


The marginal informations for the operator A and the 
operator B are 


I4(W) = S° Tr(piP#) n Tr(pi PA) (3.19) 


and 
Ip(¥) = S— Tr(p2P?) In Tr(p2P?), 
J 
where py = Tro(|W) (Y|) and pg = Tri(|V) (|). We 
define the correlation between A and B as 


Cap(¥) = Ia.p(¥) — La(Y) — In (¥) 


Tr(pPAPP? 
= S/ Tr(pPAP?) In ie iP3) a 


(3.20) 





(3.21) 


Without loss of generality, we assume that the dimen- 
sion of the Hilbert space H is equal or bigger than Ho. 
The reduced density matrix p2 for the subsystem 5S is a 
Hermitian matrix and it can be diagonalized with non- 
negative eigenvalues. Suppose that its eigenvectors are 
|y;) with eigenvalues v;. If |€;) is the complete basis of 
the subsystem S$}, the relative state of |y,;) for a general 
state |W) is 


Ibs) = Moy DIG) Es ei1¥) (8.22) 


One can show that |¢,)’s are orthonormal to each other, 


(b;lbn) = NoNow > (Eil&) (is 5/¥) (Gl, ve) 
il 


= Na Nee » (£i, 95 |U) (WE, Pe) 


= No Nox S (yj|P2ler) = 45,k - (3.23) 


Note the difference between here and Eq.(3.10), where 
|n;)’s are not the eigenstates of p2. As a result, we have 


|v) = ye 1b, 5) - (3.24) 


This is called canonical representation of |W) by Ev- 
erett and it is of course just the Schmidt decomposition. 
Now let us choose A = p; and B = po, and assume that 
there is no degeneracy in eigenvalues v;. We have 


Tp.po(¥) = Ip, (¥) = Ip.() = Sl ojInvj (3.25) 


Consequently, we have 
Cox,o2(¥) = — > yj ny; 
J 


= —Tr(p1lnp,) = —Tr(p2 In p2). (3.26) 
This special correlation is called canonical correlation by 
Everett and it is, of course, exactly entanglement. For 


convenience, we let C(V) = C,, p.(W). One may conjec- 


ture that, for any pair of operator A on S; and operator 
B on S$, the following inequality holds 


Cap(¥) <C(w). (3.27) 


This conjecture has now been proved rigorously by 


Donald (7. 


For operators @ and k= p/h, there is the Heisenberg 
uncertainty relation 


AxzAk > (3.28) 


ele 
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Everett conjectured that in terms of information this re- 
lation can be written as 
1 
I, +Ip <In—. (3.29) 
Te 


This conjecture has been proved in Ref. [18), where Ev- 
erett was not acknowledged. 


C. Measurement 


Everett regarded measurement as a natural process 
in quantum mechanics and there is no fundamental 
distinction between “measuring apparatus” and other 
physical systems. For Everett , a measurement is 
simply a special interacting process between two quantum 
subsystems, which results in the end that the property of 
the measured subsystem is correlated to a quantity in the 
measuring subsystem. The measuring process has two 
characteristics that distinguish it from other interacting 
processes. 


Suppose that we have two subsystems S$; and So, ini- 
tially in a product state |Wo) = |wWo, do) = |W) @|¢). The 
system will evolve dynamically under a Hamiltonian H of 
the whole system. According to the analysis in the above 
subsections, at any moment, the overall state |(t)) can 
be decomposed canonically as 


[W(e)) = Ds Ibj(t), 5(0) (3.30) 


where |7,(t))’s and |¢,(t))’s are eigenfunctions of two op- 
erators A(t) and B(t), respectively. The Hamiltonian H 
is said to generate a measurement if the following limits 
exist 


Aco = lim A(t) , 


t>0o 


Boo = lim B(t) 


t-0co 


(3.31) 
and they do not depend on initial conditions. 


There is one requirement for a Hamiltonian H to 
generate a measurement: H does not decrease the 
information in the marginal distribution of A. This 
means that if initially |Yo) = |¢,¢0) where |¢) is an 
eigenfunction of A, we should have at any time that 
|W(t)) = |¢, (t)). The requirement is necessary for the 
repeatability of measurements: if a spin is measured to 
be up along the z direction, it should be still up when 
we measure it again along the z direction. 


In sum, a Hamiltonian H _is said to generate a 
measurement of A in S; by B in Sp if the following 
two conditions are satisfied: (1) the correlation C4,p 
increases to its maximum with time; (2) H does not 
decrease the marginal information of A. 


We now turn to a model proposed by von Neumann [15] 
to illustrate the above definition of quantum measure- 
ment. This model consists of a particle of one coordinate 
@ and an apparatus of one coordinate 7 (which may rep- 
resents the position of a meter needle). The interaction 
between them is very strong so that we neglect all the 
kinetic energies. This means that the whole Hamiltonian 
is given by 


x . O 
Hy = thas ‘ (3.32) 
If the initial condition is a product state 
Wo) = / dq6(a) la) | drn(r) |r) 
Z dgdro(a)n(r) lar) , (3.33) 


it is straightforward to find the evolution of this state 


p(t) = f agaro(ayn(r — atlas). (8:34) 
Let us consider a special case n(r) = d(r — ro), that is, 
the apparatus needle initially points to a definite position 
ro. In this case, we have 

w() = f dasa) |aro+at). (3.35) 
It is clear that Hy has kept the marginal information of 


g. Let us consider the correlation C,,,(t). Initially, we 
have C,,,(t) = 0. At time t, we have 


Cast) = Lag tt) a I, (t) — I,(t) 
= th pee / dgld(@n|o(@)2- (3.36) 


The correlation C,,,(t) has increased to its maximum as 
soon as ¢ is not zero. The above analysis clearly shows 
that the Hamiltonian H; generates a measurement of 
qd for the system by f of the apparatus. The general 
case that the apparatus needle has no definite position 
initially is more complicated and was discussed in the 
long thesis by Everett . 


In the above discussion, the apparatus initially has a 
definite position rg. After measurement, the apparatus 
no longer has a definite position. In fact, according to 
Eq. (8.35), the apparatus is in a superposition of states 
of different positions and the probability of its position 
at ro + qt is |¢(q)|?. If this apparatus is of macroscopic 
size, this means that its meter needle does not point 
to a definite position. We of course have never seen 
this kind of measurement in any laboratory or similar 
phenomena in our daily life. To resolve this dilemma, 
one possible way to assume that the mysterious collapse 
of wave function (Process 1) during the measurement. 
Everett found that one can resolve this dilemma within 
the framework of quantum mechanics without additional 
assumption. 
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IV. OBSERVATION 


Observers are introduced as purely physical systems 
and are treated completely within the framework of quan- 
tum mechanics. In other words, observers are simply 
usual quantum systems. If this treatment is successful, it 
should build a consistent picture between the appearance 
of phenomena, i.e., the subjective experience of observers, 
and the usual probabilistic interpretation of quantum me- 
chanics. 


A. Formulation of the problem 


One can regard an observer as an automatically func- 
tioning machine that has sensors and the capacity to 
record or register past sensory data and machine con- 
figurations. When an observer O has observed the event 
a, it means that O has changed to a new state that de- 
pends on a. Observers are assumed to have memories; 
the subjective experience of an observer is related to the 
contents of its memory. As a result, the quantum state 
of an observer O should be written as 


WiA.p,.--.c)) (4.1) 
where A, B,---,C represent memories in the order of 
time. Sometimes [---,A,B,---,C] is used to indicate 


the possible previous memories that are not relevant for 
the current observations. 


Consider an observer O who wants to measure (or ob- 
serve) the property Aofa system S. The eigenfunctions 
of A are |¢;)’s. Initially, the system S is in one of the 
eigenfunctions |@;)’s of A and the observer is in state 
Wey) A good observation is defined as the one that 
results in transforming 


[wPFO) = 15) @ WP) = 1b55 HP) (4.2) 


to 
I+) = 103) © IP. a1) = 1855 8P. 


The semicolon is used here and will be used to delimit 
the system state and the observer state. The requirement 
that the system state |¢;) is unchanged is necessary if 
you want the observation is repeatable. It is clear that 
observation is just quantum measurement (introduced in 
the last section) with memories. 


(4.3) 


B. Deductions 


If the system initially is in a general quantum state 
described by >7; a; |¢;), the final total state after a good 
observation is 


Pot) = S45 1655 ¥P.au)) - (4.4) 
j 


This follows directly from the superposition principle 
and is consistent with the general framework of quantum 
mechanics. Two features stand out in the above equa- 
tions. (1) There is entanglement between the system 
and the observer and, as a result, neither of them has its 
independent state. (2) The result seems to contradict 
our daily experience. On the one hand, the final states 
are superposition of many different states, each of which 
corresponds to a definite observation outcome; on the 
other hand, there is only one outcome in our daily 
experience. 


Here comes Everett’s genius. Everett thinks that each 
superposition element in Eq. (4.4) represents a “world” 
and the observer observes different outcomes in different 
“worlds”. Since the quantum dynamical evolution is lin- 
ear, which respects the superposition, each world evolves 
on its own and in each world the observer experiences 
only one definite outcome. This is in accordance with 
our daily experience; at the same time, no additional 
assumption, such as the collapse of wave function, is 
needed. This is the so-called many-worlds interpretation. 
However, Everett himself never called each superposition 
element “world”; “many-worlds” was coined by de Witt 
in 1970s [1]. Everett call it branch. 


The above observation should be the same even in 
the presence of other systems which do not interact 
with the observer O. We thus have the general rules of 
observation. 

Rule 1 - The observation of a quantity A, with eigen- 
functions \s)F, in a system 5S by the observer O, trans- 
forms the total state according to 


[b5, p2, oie pes bP.4) 
= S14; [opt Peewee ees? a) ? (4.5) 
J 


where |e) , |gv52),---,|¢5") are the initial quantum 
states for systems $},52,---,S,, respectively, and 


aj = (6710). 


Rule 2 - Rule 1 may be applied separately to each 
element of a superposition of total system states, the re- 
sults are superposed to obtain the final total state. Thus, 


a determination of B, with eigenfunctions |¢,)°?, on S2 
by the observer O transforms the total state 
Ss ns 
Sa; 1971, w+, pd WE a,}) (4.6) 
J 


to 


S > a;be \oF, or, a AEP LANE eb) ; (4.7) 
jk 


where by = (o,? |W). These two rules follow directly 
from the superposition principle and are consistent with 
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the general framework of quantum mechanics. 


Consider again the simple case where there is one 
system and one observer. The observation results in 
Eq. (4.4). If one repeats this observation, according to 
Rule 2, the total state becomes 


S- 95 1855 VE ej,c4]) - (4.8) 
j 


Each superposition element in the above now describes 
that the observer has obtained the same result for 
both observations. That is, in each world, the observa- 
tion is repeatable. This is consistent with our experience. 


Let us go one step further by considering many differ- 
ent systems which are initially in the same state 








[boy = |p) ee) = Sl a;le;)- (4.9) 
J 
Therefore, the initial state of the total system is 
oe sar sero) = ay : pe, aca we"; bey) ; (4.10) 


The measurement is performed on the systems in the 


order $1,52,:--,S,. After the first measurement on Sj, 
we have 
S1+So+--+5,+0 1,8 Sn. 9/0 
|wyt ? y= a; |¢5,0 Pee DW SDE at) - 
J 


(4.11) 
The total state after the second measurement is 


em aaa 


S- ajak |e}; b%, w?, a) pons DE. a2) (4.12) 
J 


After r <n measurements have taken place, we have 


pSitS2t--+5n +0) = ye Aj AK +++ Ae 
J 


oj; dR, Pen Qo; port. caer, Pes OE. se e208) : 
(4.13) 


Each of the superposition elements, which is one of the 
many worlds, describes an observer which has observed 
an apparently random sequence of definite results repre- 


sented by [---,aj,az,°--, af]. If one repeats the mea- 
surement on the system S$ (m <r), the observer would 
get a memory sequence of [---,aj,--+,ay",--+, af, a7]. 


In each world, the observer feels the “collapse” of wave 
function. 


To make sense of the coefficients a;a, --- ag before each 
superposition element, we need to assign a measure to 
them. We first consider a superposition state 


lb) = yay |d;) . (4.14) 


To assign a measure, we first need that each element 
is normalized (¢;|¢;) = 1. For each element |¢;), the 
assigned measure is M(a,;), which is a non-negative 
function. We could change |¢;) to e’?|¢;) and a; to 
e°q,;, then the measure M(a;) assigned to this element 
becomes M(e~%a;). However, physically nothing has 
been changed. Therefore, we need M(a;) = M(e~‘®a,). 
For this to be true, it is clear that M(a,;) should depend 
only on the amplitude of a;, that is, M(|a;|). 


In reality, due to the accuracy of the measurement 
or other reasons, we often regard a group of states 


lbp) ,}p+1) ,°++,/bq) as the same, ie., 
q 
G1) = S05 |45) , (4.15) 
Jj=p 


where (d|d) = 1 is normalized. We require the additivity 
for the measure, that is, 


M(al) = S>M(a)) (4:16) 
(4|4) = 1 implies that 
al? = Solas? (4.17) 
and - 
M((|a;|) - (4.18) 





The only choice is the square amplitude measure, 
M(|a;|) = cla;|?, where c can be fixed by requiring 
do; M(la;|) = 1. 

This square amplitude measure M is of probability 
nature. Everett discussed this for general cases. I’ll use a 
simple case to illustrate. Let us consider a simple system 
of spin-1/2. Suppose that there are n copies of them and 
their states are the same 

We) = |v) = 

v2 


where |u) is for spin up and |d) is for spin down. We 
make observations of the spins of 6*. If the spin up |u) 
is registered as 0 and the spin down |d) is registered as 1, 
we have after measuring all the spins 


weet) = —_ > 


1 2 rr rtl 
luv, d°,---,d™u ane 





yn) (lu) + |d)), (4.19) 








. gas cpa carb coum) ; 
(4.20) 


where the summation is over all possible sequences of 0’s 
and 1’s of length n. Most of times, we only care about 
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how many of these spins are up and how many of them 
down. So, we group these states according to how many 
Spins are up 


n! 


aie - 1 
mGsmme Ge oy re Ee) 


m ups 


where |@™) represents a state where m of the n spins are 
up. The measure M for the group of m up spins ts 


n! 
m!(n — m)!2”’ 


Min = (4.22) 


which is exactly the probability that one observes the state 
of spin up m times when making n repeated same mea- 
surements. What happens here is that, after n measure- 
ments, it splits into 2” branches of worlds, each of which 
is equally probable and has a different sequence of 0’s and 
1’s registered. The chance being in a world where there 
arem up spins (orm 0s) is Mm. In general, if the spin 
is in a state |b) = alu) + b|d), we still have 2” branches 
of worlds after n measurements but the chance being in 
a world where there arem up spins (orm 0’s) is 


2m |p)2(n—m) 
—_——. b : 4,23 
ile (4.23) 
The number of branches has nothing to do with the prob- 
ability measure |a|? or |b|?; it depends on the observation 
outcomes. 


The above results can be straightforwardly generalized 
to the cases where different measurements are performed 
on different systems and different measurements are per- 
formed on the same system. 


C. Several observers 


It was pointed out at the beginning that the assumption 
of Process 1 (or the collapse of wave function) would 
lead to self-contradiction when there are more than 
one observers. There is no such contradiction in the 
many-worlds theory. Let us consider the situation 
where there are multiple observers. Three different cases 
are to be considered. 


Case 1: Two observers observe the same quantity in 
the same system. 


Observers O; and Oz are to observe the quantity A for 
the system S that is in the following state 


WS) = Do aylds) (4.24) 


where |@;) is an eigenstate of A. The observer O,; makes 
the first observation; we apply Rule 1 to the initial state 


|Vo) = |v; wes wer) and obtain 


\P1) = Sag log ber ap Vey) (4-28) 
Fl 


The observer O2 makes the second observation; we apply 
Rule 2 and obtain 


dG (ules) 1635 OE ag YE nl) 


|W2) 


I 


j,k 
So 45 [653 Oe aap De? ea] (4.26) 
J 


This shows that the first observation by Oj, leads to 
splitting of different branches of worlds, and the second 
observation by Og of the same quantity causes no 
splitting and furthermore Oz observes the same result 
as O,. It is in accordance with our daily experience: 
two observers measuring the same quantity on a given 
system always obtains the same result. This result can 
clearly be generalized to any number of observers. 


Everett even considered the situation where the two 
observers are allowed to communicate their observation 
results. It does not lead to any self-contradiction and 
contradiction to our daily experience. 


Let us now return to the paradox in Section I. The 
observer A made the measurement; the observer B did 
not make the measurement directly and he obtained 
his result by reading A’s notebook. In this case, we 
have »; 05 1955 DE ces] Posey]? where the subscript 
indicates that the result comes from A. In each world, 
the two observers A and B always agree with each other 
and have nothing to argue about. The paradox is resolved. 


Case 2: ‘Two observers measure separately two 
different quantities, which are non-commuting, in the 
same system. 


The same initial state |V)) and the same observation 
by O1. Then the observer O2 measures the quantity B, 
which does not commute with A. We apply Rule 2 to 


|W,) and obtain 


Wo) = Say (Gelb s) 1055 VE a) VE Be] > 


jik 


(4.27) 


where |y,) is the eigenstate of B. In this case, the 
second observation leads to further splitting. If A has 
Na eigenstates and B has Ng eigenstates, then there are 
NaNg different worlds in total, which are represented 
by the terms on the left hand side of the above equation. 
The measure M of the coefficients a; (yx|¢;) gives 
the probability of getting into one of the worlds if the 
observations are repeated on the same system in the 
same state. 
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Case 3: Two observers O; and Og measure two 
entangled systems S; and Sz: O; measures A in S$; and 
Og measures B in Sb. 


For simplicity, we assume that the initial state of the 
composite system of $; and S»2 is entangled 


N 
pare) = S> a; 105, P3) » (4.28) 
j=l 


where N < Ny, Ng. There is no interaction between 
S; and S_ during the following observations. The total 
initial state is 


Wo)’ = [eb F%25 Ps bra) . (4.29) 
After O, observes A in Si, the total state becomes 


N 
|B)’ = 0 a5 14), 975 BE agp YEA) - 


j=l 


(4.30) 


There are now N different branches of worlds. The ob- 
server Oz then observes B in Sz and transforms the total 
state to 


N 
2) = 32 a3 |b5, P95 O Ea Ye ay) - 


j=1 


(4.31) 


No more splitting and there are still N different branches 
of worlds. In each branch, when O, observes the result 
represented by |¢;), O2 observes the result represented by 
\y;). The observation results of O; and O2 are correlated. 
It is easy to check that if Og observes first and O, second, 
the end state is still |W)’. It is clear that the observations 
of O; and O2 do not influence each other. Furthermore, 
if O; repeats its measurement of A in Sj, then the total 
state |W)’ is turned into 


N 
[Woy’ = So 05147, 255 DP ay ag) Yea): (432) 
j=l 


We would have the same total state if O; had observed 
A in S; twice in a row before Og observed B in Sg. This 
shows that in every world O; would not know whether 
Oy has made an observation on 5S» or not if there is no 
direct communication between them. In other words, 
the entanglement in the state can not be used for 


communication. 


V. SUPPLEMENTARY TOPICS 


We have presented an abstract treatment of measure- 
ment and observation completely within the framework 
of quantum mechanics, which are in correspondence with 
our experience. Upon observation and measurement, 
there is splitting into different worlds: in each world to 


an observer there appears a collapse of wave function (or 
Process 1); however, with all the worlds combined, the 
evolution is always unitary. This approach has at least 
three advantages: (1) it is logically self-consistent; (2) it 
does not involve any additional assumption, for example, 
the collapse of wave function; (3) it is completely quan- 
tum mechanical with no use of classical concepts. 


A. Macroscopic objects and classical mechanics 


In the many-worlds theory, there is no more divide be- 
tween quantum world and classical world. Macroscopic 
objects are also described by wave functions. However, 
we do experience in our daily life a classical world where 
macroscopic objects have definite positions and mo- 
menta, moving around according to classical mechanics. 
Below is a rough explanation how the classical world 
emerges from quantum mechanics with no detailed proof. 


Let us first consider a simple case, the hydrogen atom. 
Its wave function is essentially a product of a centroid 
wave function and a wave function for the relative coor- 
dinate between the proton and the electron. The former 
describes the motion of the hydrogen atom as a whole in 
space and time; the latter is usually a bound state that 
gives us the size and shape of the hydrogen atom. The 
situation is similar for macroscopic object that we see 
daily. For example, the wave function of a cannonball 
can be written roughly as 


Itbe) = |g(X)) ® WK) = 1g(X), 0X) 


where |g(X)) is a Gaussian wave function well localized 
at the position X and |7%) is the bound state giving us 
the size and shape of a cannonball located at X. When 
the centroid wave function |g(X)) evolves over a long 
period of time, it can spread to occupy a large region of 
space. Therefore, in the general case, the cannonball is 
not necessarily well localized and its state is given by 


(5.1) 


jue) = f a(x) @ Wh) ax, (5.2) 
where |a(X)) is any smooth and normalizable function. 
In this general state, there is an entanglement between 
the centroid position X and the rest of the coordinates 
of a cannonball. 


However, in most of systems, we can separate the cen- 
troid coordinates from the coordinates for the relative mo- 
tion in the Schrédinger equation; as a result, |) does 
not depend on X explicitly. In this case, we can take |) 
outside of the above integral and have |.) = |a) ® |W). 
When |a) is not well localized, the state |W',) represents 
a kind of “smeared out” cannonball. In contrast, we only 
see cannonballs with definite positions and momenta in 
our daily life. This dilemma can be resolved by notic- 
ing that cannonballs are never truly isolated and they are 
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constantly be observed or measured by photons and other 
objects. We assume that someone magically set up a can- 
nonball in a superposition state of two well separated and 
localized Gaussians, that is, 


1 
v2 


The initial state of this cannonball and its observer is 
were) — er WP)» where Wea) is the state of an ob- 
server, which can be photons or other physical objects 
that can distinguish the difference between the positions 
X, and X2. After the interaction between the cannonball 
and the observer (which happens in a very short time), a 
splitting of worlds happens and the total state becomes 


[tbe) = = (19(X1)) + lg(X2))) @ bb) . (5.3) 


just) = 5 (lal Px) 


+ |9(X2); VE.,xa))) O14"). (6.4) 
In one world, the observer finds the cannonball well 
localized at X1; in the other world, the observer finds 
the cannonball well localized at X2. This is why we do 
not observe “smeared out” cannonballs. The cannonball 
and other similar macroscopic objects with well localized 
wave functions will move approximately according to the 
classical mechanics. After a certain period of time, the 
wave packet will spread out so much to cause another 
splitting of worlds. The detailed account of how long 
a well-localized wave packet will follow the classical 
trajectory is given by Ehrenfest time [19]. Everett did 
not use the concept of Ehrenfest time in his thesis. 


B. Ampilication processes 


In our abstract discussion of measuring process in 
the previous sections, we have simplified the coupling 
between the system and the observer (or the apparatus). 
In reality, there is a chain of intervening systems linking 
a microscopic system to a macroscopic apparatus. Each 
system in the chain of intervening systems is correlated 
to its predecessor, resulting an amplification of effects 
from the microscopic system to a macroscopic apparatus. 


We use Geiger counter as an example to illustrate 
this amplification process. A Geiger counter con- 
tains a large number of gas atoms that are placed 
in a strong electric field. The atoms are metastable 
against ionization. More importantly, the product of 
ionizing one gas atom can cause ionization of more 
atoms in a cascading process. This chain reaction 
correlates large number of gas atoms: either very few 
or very many of the gas atoms are ionized at a given time. 


To put the above discussion in a mathematical form, 
we write the state of a Geiger counter in terms of its 


individual gas atoms 


le) = 'S Cij---k Pir jy 


ijenk 


where |¢;,@;,:--, x) represents a state where the first 
atom is in the ith state, the second atom is in the jth 
state, ..., the nth atom is in the kth state. The superpo- 
sition terms on the right hand side of the above equations 
describe either large number of ionized atoms or few ion- 
ized atoms. Due to the chain ionization, there are almost 
no terms for medium-sized number of ionized atoms. By 
choosing a medium-sized number, we can place these su- 
perposition terms in two groups 


ay wy) = D> aig... [bis O5s ++ On) (5.6) 
ijk 
and 
a2 |p) = Saige |di, bj,°°+, Oe) (5.7) 


The primed summation is over all terms with few number 
of ionized atoms and the double primed summation is 
over all terms with very large number of ionized atoms. 
|¥p}) and |¢17}) represent, respectively, two macroscopic 
distinguishable states of a Geiger counter, discharged or 
undischarged. As a result, the state of a Geiger counter 
can be simply written as 


|b?) = a1 |dpoq) + 2 |v) - 


Consider a particle which is detectable by a Geiger 
counter. The total initial state is 


[OETS) = [Ps by) 


where |?) is the state of the particle. If the wave func- 
tion |q)?) is not well localized so that it has a part |w?) 
outside of the Geiger counter and the other part |w?) in- 
side the Geiger counter, i.e., |W?) = a|w?) +b|w?). After 
the particle encounters the Geiger counter, the total state 
is transformed to 


[ORT = aldPs dy) + b/d? ; yp) - 


We have a splitting into two worlds: in one world the 
Geiger counter is discharged and in the other one the 
counter is undischarged. This is similar to the splitting 


in Eq. (5.4). 


(5.8) 


(5.9) 


(5.10) 


C. Reversibility and irreversibility 


In the usual treatment of quantum mechanics, there 
are both Process 1 (the collapse of wave function) and 
Process 2 (unitary evolution). It is obvious that Process 1 
is irreversible and Process 2 is reversible. This difference 
can be quantified by introducing another information 


I, = Tr(plnf), (5.11) 
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where /f is a density matrix of a quantum system. If 
the system changes according to Process 2, we have fp! = 
UpU', which does not change I p since 


Ip = Tr(é' Inf’) = Tr[U Au! In(UAu")] 


Tr[UflnpU"] = Tr(plnf) =J,. (5.12) 


For Process 1, we consider a simple case where the system 
is in a pure quantum state, that is, 6 = |w) (~|. The 
measurement is for the quantity A, whose eigenfunctions 
are |@;). After the measurement (Process 1), there is a 
probability of | (¢,;|q) |? of the measured result is |d;). 
This means that the density matrix becomes 


i = S71 oslv)P lea) (al (5.13) 


and 


Ty = > (eslb) Pin (I (e510) P) <= 0. (6.14) 


J 


So, Process 1 decreases the information J, but never 
increase it. One can prove rigorously that this is true for 
any p not just for pure states. 


In the many-worlds theory, even though only Process 
2 is recognized, an observer can still feel similar irre- 
versibility on the subjective level. When an observation 
is performed, it leads to a superposition of many differ- 
ent worlds. From this time forward, since the unitary 
evolution is linear, these worlds are parallel, evolve 
independently, and no longer influence each other. The 
observer in each world has only information in his world, 
knowing nothing about other parallel worlds. As a re- 
sult, for an observer in a given world, this process is also 
irreversible since he can not in principle get to know the 
state before the measurement based on the information 
available in his world. This irreversibility implies that 
there is a fundamental limit on the knowledge of the 
entire universe. 


However, the irreversibility discussed here appears not 
related to the second law of thermodynamics, which re- 
flects a different kind of irreversibility. There are two 
ways to see the difference. First, the former is of quan- 
tum nature while the latter is also valid in classical sys- 
tems. When one mixes two piles of sand of different 
colors, there is clearly no quantum process involved but 
the mixing is irreversible as dictated by the second law. 
Second, when the universe keeps splitting into more and 
more worlds, more and more systems get entangled to- 
gether. In this sense, the irreversibility associated with 
the world splitting is for an open system whereas the sec- 
ond law of thermodynamics is for a closed system. These 
are definitely not the final words on the relation between 
the second law of thermodynamics and the splitting of 
worlds, which warrants further study. In fact, Tegmark 
discussed the second law of thermodynamics within the 


framework of many-worlds theory using a tripartite par- 
tition of the universe [8). My personal view is that to dis- 
cuss the second law within quantum mechanics one has 
to follow von Neumann, who defined quantum entropy for 
pure states (different from the well-known von Neumann 
entropy) and proved quantum H-theorm : 


D. Approximate measurement 


In many situations, we have only approximate mea- 
surements, where the apparatus or observer interacts 
weakly with the system and for a finite time. It is hard 
to understand these cases with Process 1, which requires 
that all measurements result in a precise projection to 
an eigenstate of a measured quantity. The position mea- 
surement appears to be the best example to illustrate 
this difficulty. 


In any situation, we do not know the precise position of 
any particle. One possible way to understand this with 
Process 1 is that the measurement indeed results in a 
precise position but the observer has only imprecise in- 
formation. This view is clearly wrong. In practice, for 
example, when tracking high-energy particles with cloud 
chambers, we can measure the approximate positions of 
a particle successively. This means that we can predict 
approximately the position of a particle with its current 
approximate position. If Process 1 were true, after the 
measurement, the particle would be in an eigenstate of 
position and its momentum would be too uncertain to 
make any meaningful prediction for its future position. 
This contradicts well-established experimental facts. Ev- 
erett has offered more detailed analysis along this line and 
pointed out the inadequacy of Process 1 in approximate 
measurement. 


E. Discussion of a spin measurement example 


Consider the z component of a spin-1/2 with the Stern- 
Gerlach setup. In this measurement, a particle of spin- 
1/2 passes through a magnetic field that is inhomoge- 
neous along the z direction. The measurement is essen- 
tially to couple the spin and the orbital of the same par- 
ticle. For simplicity, we keep only the coupling part of 
the Hamiltonian and approximate only the constant and 
linear part of the inhomogeneous field 

Ay = po,(Bo + zB), (5.15) 
where yz is the magnetic moment of the particle. The 
initial state of the particle is assumed to be 


Yo) = bo(z)(crlu) +erld)), (5.16) 


where ¢9(z) describes a wave packet along the z direction 
and |u) (|d)) is the eigenfunction of 6, with eigenvalue 1 
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(-1). One can solve the Schrédinger equation. If At is 
the time that the particle takes to traverse the field, we 
have 


|W(At)) = ¢go(z) pee er eOsy lu) 


dgpettlBoteBi)At/n) |\d) . (5.17) 


This is an entangled state between the spin and the or- 
bital. The wave function has split into two: one with mo- 
mentum yf, At and the other with momentum —yH; At. 
With long enough flying time, these two parts will be- 
come well separated in space: the upper wave packet for 
the spin up state |u) and the lower wave packet for the 
spin down state |d). The measuring “apparatus” here is 
the orbital degree of freedom of the particle, which by all 
means 18 Microscopic. 

In many situations, one can regard states for composite 
systems such as Eq. (6.17) as a non-interfering mixture of 
states by ignoring phases in superposition elements. For 
example, it is correct when calculating marginal expec- 
tations for subsystems. For the state (5.17), it is alright 
to regard it as a mixture if one cares either the spin or 
the orbital but not both. The phase relations between 
different superposition elements are important. For this 
Stern-Gerlach system, it is possible to re-combine the two 
terms in Eq. (5.17) in another imhomogeneous magnetic 
field and restore the original state in Eq. (5.16) 23}. For 
this to happen, one can not disregard the phases. 


VI. DISCUSSION 


We have shown that our theory (the many-worlds 
theory) can be put in a satisfactory correspondence with 
experience, and gives us a complete conceptual model 
of the universe with more than one observers. In this 
theory, the wave function is a basic description of phys- 
ical systems, including observers, and the probabilistic 
assertion of quantum mechanics can be deduced from 
this theory as subjective appearances to the observers. 
This theory constitutes an objective framework in which 
puzzling subjects, such as classical phenomena, the mea- 
suring process, the inter-relationship of several observers, 
reversibility and irreversibility, can be investigated in a 
logically consistent manner. 


In light of his new theory, Everett discussed in length 
other interpretations of quantum mechanics existing at 
his time. They are 


a. The “popular” interpretation. The wave func- 
tion |W) changes continuously and deterministically 
with a wave equation when the system is isolated 
but changes probabilistically and abruptly upon 
observation. 


b. The Copenhagen interpretation. The wave func- 
tion |q) is regarded as just a mathematical ar- 


tifice which one uses to make statistical predic- 
tions. All statements about microscopic phenom- 
ena are meaningful only within a classical experi- 
ment setup. 


c. The “hidden variable” interpretation. The wave 
function |y) is not a complete description of a sys- 
tem. There are additional hidden parameters in the 
correct and complete theory that is to be developed 
in the future. The probability in quantum mechan- 
ics is the result of our ignorance of these hidden 
variables. 


d. The stochastic process interpretation. In this the- 
ory, physical systems are undergoing probabilistic 
changes at all times. The discontinuous and prob- 
abilistic “quantum jump” are not the result of ob- 
servation and measurement but are fundamental to 
the systems themselves. 


VII. APPENDICES 


In Everett’s long thesis, there are two appendices. In 
the first one, Everett offered detailed proofs for many 
mathematical relations in the main text. In the second 
one, he offered his view on theoretical physics in general. 
Here is the summary of the second appendiz. 


There are a number of interpretations of quantum 
mechanics, most of which are equivalent in the sense that 
they agree with all the physical experiments. To decide 
among them, we must go beyond experiments and discuss 
the fundamental nature and purpose of physical theories. 


Every theory has two separate parts, the formal part 
and the interpretive part. The formal part consists of 
a purely logico-mathematical structure that consists of 
a collection of symbols and rules for their manipulation. 
The interpretive part is a set of association rules that 
relate the formal symbols with the experienced world. 
There can be many different theories which are logical 
consistent and correct in explaining the perceived world. 
In this case, further criteria such as usefulness, simplic- 
ity, comprehensiveness, pictorability, etc., must be used 
to select the theory or the theories. In particular, sim- 
plicity refers to conceptual simplicity not ease in use. It 
is harmful to the progress of physics that a physical the- 
ory should contain no elements which do not correspond 
directly to what we observe. 


Part IV 
The legacy of Everett’s 
theory 


Although Everett published his short thesis in Review 
of Modern Physics, a well known and respected journal, 
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along with Wheeler’s supporting article [4 [5], his 
theory of the universal wave function had received little 
attention for many years [3] [24]. In 1962, Everett was 
invited by Podolsky to a small workshop at Xavier 
University, where he lectured on his theory for the first 
time in public. However, this did not mean that his 
theory was getting wide recognition; it only showed that 
his work was not completely forgotten [3]. 


Everett’s theory began to be noticed widely in the 
physics community only after DeWitt started to pro- 
mote it as the many-worlds theory around 1970s with 
Graham’s help [I] [3] 24]. As a result, Everett’s theory 
is now widely known as the many-worlds theory. Only 
a limited number of specialists know it as the theory 
of the universal wave function or the “relative state” 
formulation of quantum mechanics. Now Everett’s 
theory has to be mentioned in all serious discussion of 
the subject known as the interpretation of quantum 
mechanics [25]. 


To celebrate the 50th anniversary of its publication 
in 2007, the influential magazine, Nature, put the 
many-worlds theory on its July cover; and BBC pro- 
duced a special program called “Parallel worlds, Parallel 
lives”. This theory is now extremely popular among 
non-specialists. 


Among physicists, the many-worlds theory is still a 
minority view but it is gaining momentum. Hawking [26] 
and Gell-Mann were famous names among its early 
supporters. Current influential advocates include David 
Deutsch [6], Max Tegmark [8], and Sean Carroll [28]. 
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